147 results found.
Written
Corpus,
Language Type:
Multilingual
Languages:
Czech
Availability:
From Data Center(s)
License:
LDC
Size:
50 thousand Production Status:
Existing-updated
Use:
Discourse
Paper:
N/A
Documentation:
http://ufal.mff.cuni.cz/pdt2.0/doc/pdt-guide/en/html/ch05.htmlLanguage Type:
Multilingual
Languages:
Czech
Availability:
<Not Specified>
License:
LDC
Size:
<Not Specified> Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
Czech
Availability:
Freely Available
License:
Creative Commons BY-NC-SA
Size:
1957247 tokens Production Status:
Existing-updated
Use:
Parsing and Tagging
Paper:
N/A
Documentation:
Yes, English and Czech, Yes. http://ufal.mff.cuni.cz/pdt2.5/en/documentation.html
Written
Corpus,
Language Type:
Multilingual
Languages:
Czech Romanian Slovak Spanish Vietnamese
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> <Not Specified>Production Status:
Existing-used
Use:
Evaluation/Validation
Paper:
N/A
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Czech English German Russian Spanish
Availability:
Not Available
License:
<Not Specified>
Size:
2k terms per language Production Status:
Newly created-in progress
Use:
Opinion Mining/Sentiment Analysis
Paper:
N/A
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
Bulgarian Czech English Hungarian Romanian
Availability:
From Data Center(s)
License:
ELRA
Size:
75Mbyte Production Status:
Existing-used
Use:
POS Induction
Paper:
N/A
Documentation:
English
Written
Corpus,
Language Type:
Multilingual
Languages:
Czech English German Hindi Italian Persian
Availability:
Freely Available
License:
Creative Commons - Attribution-{NonCommercial}-{ShareAlike} 4.0 International ({CC} {BY}-{NC}-{SA} 4.0)
Size:
162M sentences Production Status:
Newly created-finished
Use:
Corpus Creation/Annotation
-
Paper title:LSCP: Enhanced Large Scale Colloquial Persian Language Understanding
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Poster+DemoSuggested
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Mahdi Bohlouli | Large-Scale Colloquial Persian 0.5 | /N |
Documentation:
None
Written
Treebank,
Language Type:
Monolingual
Languages:
Afrikaans Akkadian Amharic Ancient Greek Arabic Armenian Assyrian Bambara Basque Belarusian Bhojpuri Breton Bulgarian Buryat Cantonese Catalan Chinese Classical Chinese Coptic Croatian Czech Danish Dutch English Erzya Estonian Faroese Finnish French Galician German Gothic Greek Hebrew Hindi Hindi English Hungarian Indonesian Irish Italian Japanese Karelian Kazakh Komi Permyak Komi Zyrian Korean Kurmanji Latin Latvian Lithuanian Livvi Maltese Marathi Mbya Guarani Moksha Naija North Sami Norwegian Old Church Slavonic Old French Old Russian Persian Polish Portuguese Romanian Russian Sanskrit Scottish Gaelic Serbian Skolt Sami Slovak Slovenian Spanish Swedish Swedish Sign Language Swiss German Tagalog Tamil Telugu Thai Turkish Ukrainian Upper Sorbian Urdu Uyghur Vietnamese Warlpiri Welsh Wolof Yoruba
Availability:
Freely Available
License:
Various
Size:
25 million words Production Status:
Existing-updated
Use:
Parsing and Tagging
-
Paper title:Universal Dependencies v2: An Evergrowing Multilingual Treebank Collection
-
Paper track:Written/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Joakim Nivre | Universal Dependencies | /N |
Documentation:
https://universaldependencies.org
Written
Corpus,
Language Type:
Monolingual
Languages:
Czech
Availability:
under CreativeCommons License
License:
CreativeCommons
Size:
73647 tokens Production Status:
Newly created-in progress
Use:
Named Entity Recognition
-
Paper title:Czech Historical Named Entity Corpus v 1.0
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Helena Hubková | Czech Historical Named Entity Corpus v 1.0 | /N |
Documentation:
Annotation_manual, README
Written
Corpus,
Language Type:
Monolingual
Languages:
Arabic Chinese Czech English Finnish French German Hindi Indonesian Italian Japanese Korean Polish Portuguese Russian Spanish Swedish Thai Turkish
Availability:
Freely Available
License:
CC-BY-SA
Size:
300 KByte Production Status:
Newly created-finished
Use:
Emotion Recognition/Generation
-
Paper title:How Universal are Universal Dependencies? Exploiting Syntax for Multilingual Clause-level Sentiment Detection
-
Paper track:Written/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Hiroshi Kanayama | Parallel Sentiment | /N |
Documentation:
For 19 languages (ar,cs,de,en,es,fi,fr,hi,id,it,ja,ko,pl,pt,ru,sv,th,tr,zh)




